Essential Pentaho ETL by Gowda Aryan Kavan & Gowda Aryan Kavan

Essential Pentaho ETL by Gowda Aryan Kavan & Gowda Aryan Kavan

Author:Gowda, Aryan Kavan & Gowda, Aryan Kavan
Language: eng
Format: epub
Published: 2020-12-24T00:00:00+00:00


Start the Pentaho server

Action Plan

Configure your PDI client (Spoon) to PDI server.

Configure the KETTLE_HOME directory.

Chapter 4: Dealing with Data

In this chapter you’ll learn about the following:

Reading files using PDI Spoon

Reading tables using PDI Spoon

Reading REST API data using PDI Spoon

The PDI ETL tool has several steps for dealing with different formats of data. The Pentaho PDI Spoon has several steps grouped by the category i.e., input, output, transformation, streaming, statistics, big data, scripting, data warehouse, bulk loading steps that allow you to read, write and transform all the structured, semi-structured and non-structured data. In this chapter, you will learn not only the basics for reading and writing data, but also all the how-to’s for dealing with them.



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Popular ebooks
A Developer's Guide to Building Resilient Cloud Applications with Azure by Hamida Rebai Trabelsi(9340)
Distributed Machine Learning with Python by Guanhua Wang(3603)
Getting Started with CockroachDB by Kishen Das Kondabagilu Rajanna(2576)
Exploratory Data Analysis with Python Cookbook by Ayodele Oluleye(1417)
Getting Started With CockroachDB: A Guide to Using a Modern, Cloud-Native, and Distributed SQL Database for Your Data-Intensive Apps by Kishen Das Kondabagilu. Rajanna(1235)
R Web Scraping Quick Start Guide by Olgun Aydin(1082)
PostgreSQL 13 Cookbook: Over 120 recipes to build high-performance and fault-tolerant PostgreSQL database solutions by Vallarapu Naga Avinash Kumar(1016)
Mastering PostgreSQL 15 - Fifth Edition by Hans-Jürgen Schönig(689)
Apache Hadoop 3 Quick Start Guide by Hrishikesh Karambelkar(450)
Pandas for Everyone: Python Data Analysis, 2nd Edition by Daniel Y. Chen(446)
Learn SQL with MySQL: Retrieve and Manipulate Data Using SQL Commands with Ease by Ashwin Pajankar(406)
SQL Query Design Patterns and Best Practices by Steve Hughes & Dennis Neer & Dr. Ram Babu Singh & Shabbir H. Mala & Leslie Andrews & Chi Zhang(391)
Deploy Node.js on GCP: A comprehensive guide to deploying Node.js on Google Cloud Platform by Jonathan Lin(377)
Configuring Sales and Distribution in SAP ERP by Unknown(360)
Leveling Up with SQL by Mark Simon(336)
Learning Data Science by Sam Lau(325)
Intermediate Python by Oswald Campesato(321)
The Definitive Guide to Data Integration by Pierre-Yves BONNEFOY Emeric CHAIZE Raphaël MANSUY Mehdi TAZI(303)
Data Engineering with AWS: A Comprehensive Guide to Building Robust Data Pipelines by Paul Brian(296)
Pandas Basics by Oswald Campesato(294)